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ABSTRACT 


Likelihood  ratio  tests  concerning  the  parameters  of  two 
multinomial  populations  are  discussed.  A  stochastic  ordering 
restriction  is  considered  as  a  one  sided  alternative  to 
equality.  The  one  and  two  sample  tests  for  equality  versus 
stochastic  ordering  and  stochastic  ordering  versus  all 
alternatives  are  derived  and  their  large  sample  distributions 
are  obtained.  The  large  sample  distributions  are  mixtures  of 
chi-squared  distributions.  The  tests  developed  provide 
discrete  analogues  for  the  one  sided  Mann-Whitney-Wilcoxin 
and  Kolmogorov-Smirnov  tests. 
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1 .  INTRODUCT ION .  Tests  for  the  equality  of  two  populations 

against  a  stochastically  ordered  alternative  are  among  the 

more  widely  used  nonparametric  procedures.  They  include  the 

one  sided  Mann-Whitney-Wilcoxon  and  Kolmogorov-Smirnov  tests. 

We  consider  analogous  one  and  two  sample  likelihood  ratio 

procedures  under  the  assumption  that  the  underlying  populations 

are  discrete.  It  is  well  known  that  one  sided  procedures  are 

more  powerful  then  their  two  sided  counterparts.  Thus  these 

procedures  are  recommended  over  the  standard  chi-squared  tests 

provided,  of  course,  that  the  underlying  assumptions  are  valid. 

We  denote  the  two  collections  of  multinomial  parameters 

by  p  =  (plfp2, . . . ,Pk)  and  q  =  (q1»q2» . . . ,qk>  and  we  assume 

that  both  p  and  q  are  in  A  =  {  (x,  ,x0, . . .  ,x.  ) :  x.  >  0,j;  x.  =  1}. 

x  /  ic  i  i=l  ^ 

Consider  the  hypothesis  that  the  q  distribution  is  stochastically 

larger  than  the  p  distribution.  Specifically, 

i  i  k  k 

(1.1)  H.:  ][  P-s  I  i  =  l»2,...,k-l,  J  P-s  *  I  *I.i  * 

1  j=l  3  j-1  3  j=l  3  j=l  3 

If  (1.1)  holds  we  say  that  p  majorizes  q  and  denote  this 

symbolically  by  p  >>  q.  The  three  hypotheses  to  be  considered 

here  are  HQ:  p  =  q,  and  H2  =  -  H1  (not  H^)  and  we  shall 

consider  both  one  and  two  sample  tests. 

Chacko  (1966)  studied  a  likelihood  ratio  statistic  for 

testing  the  null  hypothesis  that  p  =  qQ  =  k_1 (1,1, . . . ,1)  against 

the  alternative  that  p^  p2  *  *  *  —  Pk  (an<*»  course,  p  +  qg). 

The  hypothesis  p  >>  k”1 (1,1, . . . ,1)  is  implied  by  the  hypothesis 

Pi  i  ?2  1  •••  1  Pk'  but  not  conversely,  so  that  the  test 

discussed  here  for  testing  p  *  qQ  against  the  alternative 


j 
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p  >>  qQ  has  a  less  restrictive  alternative  than  that  considered 
by  Chacko.  It  is  interesting  to  note  that  the  statistic, 
derived  in  Section  3,  for  testing  p  =  versus  p  >>  q^  has  a 
chi-bar- squared  distribution  as  did  Chacko* s  test  statistic. 
Robertson  (1978)  generalized  Chacko* s  work  by  considering  the 
test  of  p  =  q  (arbitrary  q)  against  an  arbitrary  order  restric¬ 
tion  on  p.  He  also  considered  the  problem  of  testing  an  order 
restriction  on  p  as  a  null  hypothesis. 

In  Section  2,  the  one  and  two  sample  maximum  likelihood 
estimates  of  the  multinomial  parameters  subject  to  the  restric¬ 
tions  in  are  derived.  The  distribution  theory  for  the  one 
sample  tests  of  HQ  versus  -  HQ  and  versus  H2  is  given 
in  Section  3  and  Section  4  contains  the  corresponding  two 
sample  theory. 

2.  RESTRICTED  MAXIMUM  LIKELIHOOD  ESTIMATES.  In  order  to 

develop  the  desired  likelihood  ratio  tests  we  must  first  obtain 

the  maximum  likelihood  estimates  under  the  restriction  p  >>  q. 

The  approach  uses  the  theory  given  in  Section  5  of  Barlow  and 

Brunk  (1972)  which  requires  the  following  notation.  For  any 

collection  of  positive  weights,  w  =  (w^,w2, . . .  ,\iy) ,  let  (x,y)w 

be  the  inner  product  on  R  defined  by  (x,y)  ■  \  x.y.w. ; 

w  i _i  111 

let  ll*llM  denote  the  induced  norm  (ie.  | |x| |  *  l  xf  w. ) ; 

,  i=l  1  x 

and  for  any  subset  A  of  RK  let  Ew(x|A)  denote  the  projection 

(ie,  closest  point  under  I  I  * llv)  of  x  onto  A  provided  it  exists 

and  is  unique  (cf.  Brunk  (1965)). 

We  first  consider  the  one  sample  problem.  Assume  q  is 

known;  assume  a  random  sample  of  size  m  from  the  population 
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associated  with  p  and  let  p  =  (Pi'P2'  *  *  *  '?k>  136  the 

A 

vector  of  relative  frequencies  (ie.  mp  has  a  multinomial 
distribution  with  parameters  m  and  p) .  Let  C=  {xeR^s  x^^  •>  x2^.. 
and  note  that  C  is  a  closed  convex  cone  in  R  ,  so  that  by  Brunk 
(1965),  Ew ( *  | C )  is  well  defined. 

/V 

Theorem  2.1  If  p^  >0;  i  *  l,2,...,k,  then  the  maximum  likeli¬ 
hood  estimate  of  p  subject  to  is  qiven  by 

(2.1)  P  -  pE- (q/p|C) , 

where,  for  x,yeRk,  xy  denotes  the  vector  (x^yirx2y2» • • • »xkyk) 
and  x/y  ■  (*1/y1,x^y2> . . .  #Xj/yk) . 

Before  the  proof  of  Theorem  2.1  is  given,  we  describe  the 
lower  sets  algorithm  (LSA)  for  computing  Ew(x|c).  For  A  a 
nonempty  subset  of  {l,2,...,k},  set 

M(A)  -  IiEA  “iVluJl  Wi‘ 

Set  ig  =  0  and  choose  i^  the  largest  positive  integer  i  which 
maximizes  M({iQ+l, . . . ,i}) .  Next  choose  i2  the  largest  integer 
i  greater  than  i^,  which  maximizes  M({i^+1, . . . ,i}) .  Continuing 
this  process,  we  obtain  0  *  iQ  <  ij^  <  . . .  <  i^  »  k  and  the 
projection 

Ew(x|c) i  =  M({i^_1+l,...,ij})  for  i£{ij_i+l, • • • ,ij >  and 
j  m  1»2,...,£. 

The  sets  {i^_^+l, . . . ,ij  }  are  called  the  level  sets. 

Proof.  The  m.l.e.,  p  *  (Pj/P2' • • • »Pk) »  solves  the 
following  optimization  problem: 

k 

minimize:  -  l  mp.  In  p.  subject  to  p  >>  q. 
i«l  1  1 
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—  ^  ^  A  A  A  A  A 

Set  s  —  in  (P^/P^ #P2/?2 #  *  *  *  *  *  a  ®^Pi#P2»  •  •  *  »Pjj)  * 

*  A  A 

g  =  m  (q1/p1#q2/P2*  •  •  •  ^jj/Pjj)  #  and  <M y)  *  -In  y.  Then 

—  •  2^  _  A  —  A  ^  A 

s  =  m  (P1/Pi»P2/P2» • • • »Pk/pk)  solves 

k  i 

(2. 2) minimize:  £  w.  $(s.)  subject  to  I  w.(g.-s.)  <  0; 

i-l  1  k  3*1  3  3  ]  “ 

1  ±  i  <  k  and  \  w. (g.-s.)  =  0. 

j=l  J  J  J 

w* 

The  Fenchel  dual,  C  ,  of  C  is 


C  =  {u;  (u,v)  <  0  for  each  veC) 

i  w  “  k 

=  {u;  l  u.w.  <0;  1  <  i  <  k,  £  u.w.  =  0}. 
j=l  J  J  ~  “  j=1  3  D 

(cf.  Barlow  et  al.  (1972)  pp.  49).  Thus  (2.2)  becomes 


.  .  ,  r  w* 

minimize:  \  w.  $(s. )  subject  to  g-seC 

i=l  1  1 

and  by  Theorem  3.4  of  Barlow  and  Brunk  the  solution  to  (2.2)  is 
unique  and  is  the  projection  of  g  onto  the  cone  C.  Thus 


P  -  mp  Ew(q/mp|C)  =  p  E-(q/p|C). 


Theorem  2.2.  As  m-*00,  p  converges  almost  surely  to  p  provided 

P  >>  q. 

/v 

Proof .  By  the  strong  law  of  large  numbers,  p-*-p  a.s.  as 
Moreover,  Ew(x|C)  is  continuous  in  both  w  and  x  so  that 
p— *pEp (q/p | C)  a.s.  Using  the  LSA  to  compute  Ep(q/p|C),  one  sees 
that  since  p  >>  q,  M((l,...,i))  £  1  with  equality  for  i  *  k. 
Hence,  Ep(q/p) |C)  *  ek  where  ek  is  the  k-dimensional  vector  of 
ones  and  so  p  Ep(q/p) |C)  *  p. 

A 

In  the  two  sample  problem  let  q  denote  the  vector  of 
relative  frequencies  of  a  sample  of  size  n  from  the  q  population 


Proof .  Our  maximum  likelihood  estimation  problem  is  the 
same  as  the  one  described  by  (5.5),  (5.6)  and  (5.7)  in  Barlow 
and  Brunk  (1972)  and  they  have  shown  that  the  solution  also 
satisfies 


A.  *  A  A  A 

l  Pa  1  l  (mp.+nq.)/N  >  £  q. 
j=l  3  j=l  3  3  j=l  3 

for  i  =  l,2,...,k-l  with  equality  for  i  =  k.  Letting 

/S  /\  A  /S  A 

t  =  (pj/mp-^pj/nipj, . . .  fP^/mp^qj/nq^  . . .  ,qk/nqk)  these  restrictions 


are  equivalent  to 
i 


k+i 

=kn 

2k 


(2.4)  £  w.  (t.-h.)  >  0  and  £  w.  (h.-t.)  >  0;  i  =  l,2,...,k-l 

j=l  3  3  3  ~  j=k+l  3  3  3 


and  £  w,  (t.-h.)  =  0  =  £  w. 

j=l  3  3  ^  j=k+l  3 


'VV' 


From  Barlow  and  Brunk  (1972),  (2.4)  is  equivalent  to  h-teB 


w* 


Hence,  with  <j>  as  before  t=  (p1/mp1,p2/mp2r  •  •  •  »Pk/mp]c,q1/nq1, . . .  ,qk/uq^ 


solves : 


V'ii 


minimize  J  w.  <j»(t . )  subject  to  h-teBw*. 
i=l  1  1 

Appealing  to  Theorem  3.4  of  Barlow  and  Brunk  (1972)  again,  we 
have  that  (p,q)  =  w  Ew(h|B),  which  is  the  desired  conclusion. 

Since  membership  in  K  imposes  no  restrictions  between  the 
first  k  coordinates  and  the  last  k  coordinates  of  a  point, 

(E(- |b)1,E(* |B)2, - ,E(- |B)k)  and  (E(« |B)k+1, . . . ,E(* |®) 2k>  can 

be  computed  independently.  It  follows  that 


(2.5) 


P  -  P  E 


4"?")  _ 

;  ["SH  •  - 1  [  •’¥'4 


q  =  q  e 


where  C'  denote  the  cone  {x:  x^^  <_  x2  £  ...  ±  xk>. 

Theorem  2.4.  If  p  >>  q,  then  p[lim_  _  (p,q)  =  (p,q)l  =  1. 

lU  f  I*-*00 

Proof .  Since  Ew(g+ek|C)  =  Ew(g|c)  +  ek,  it  follows  from 

(2.5)  that 

A  A  A  /V 

(2.6)  p-p  =  (n/N)  p  E^(3^E|C)  and  q-q  =  - (m/N)  q  E^(3l£|c). 

P  p  4  q 

A  /s 

By  the  strong  law  of  large  numbers  p[limm  (p,q)  -  (p»q)l  *  1. 

in  f 

/v  /s. 

Since  (n/N)p  and  (ra/N)q  are  bounded  and  Ew(x|c)  is  continuous  in 
x  and  w,  we  need  only  show  that 


or  equivalently. 


Ep<a^|c>  >  Eq(3^E|C)  =  0, 


Vp|c>  -  -V flC>  *  v 


In  the  proof  of  Theorem  2.2,  it  was  shown  that  p  >>  q  implies  that 
E  (q/p|C)  ■  ek  and  the  proof  of  E  (-p/q|C)  ■  -ek  is  similar. 


»*»*.■  •fc'.fcjaflVaiiwBirtiiii 


It  is  interesting  to  observe  that  (p,q)  is  strongly 
consistent  for  (p,q)  for  any  sequence  of  sample  sizes  (m,n) 
provided  m  and  n  simultaneously  approach  ®. 

3.  TESTS  WITH  A  KNOWN  STANDARD:  ONE  SAMPLE  TESTS.  In  this 

and  the  next  section  we  use  X  generically  to  denote  the 

likelihood  ratio.  Suppose  q  is  known  and  that  we  have  a  random 

sample  of  size  m  from  the  p-population  and  consider  testing 

Hq :  p  ■  q  against  Hj^  -  Hq  where  p  >>  q.  Let  SQ1  *  -2  In  X  = 

-2m  l  p. (In  q.-ln  p. ) .  Since  Hn  is  a  boundary  point  of  H. 

i=l  1  1  1  0  1 

the  usual  limiting  chi-squared  results  for  •?  In  X  do  not  apply. 

However  the  next  result  shows  that  the  limit  distribution  is  a 

mixture  of  chi-squared  distributions.  Before  stating  the  result 

we  define  the  mixing  proportions.  Let  w  =  (w1#w2, . . . ,w^)  be 

positive  weights  and  let  W^,W2 , . . . ,Wk  be  independent  normal 

variables  with  zero  means  and  variances  . . .  ^w^1 

respectively.  We  denote  the  probability  that  Ew(w|C)  has  exactly 

l  distinct  values  (level  sets)  by  Pw(£,k). 

Theorem  3.1.  If  Hq  is  true  then  for  any  real  number  t 

PIS0l  i  *  l]ml  txiU  - 

2  .  2 
where  x.,  is  a  chi-squared  variable  with  v  degrees  freedom  (xft=0)  . 
V  u 

Proof .  Writing  a  second  order  Taylor's  expansion  for  In  q^ 

/v 

and  In  about  the  point  p^f  SQ1  can  be  expressed  as  follows: 
(3.1)  SQ1  *  l  ^  a^2(»€(pi-qi))2  -  l  ^  Pie^t^mCPi-Pi))2 

A  A 

where  a.  is  between  p.  and  q.  and  0^  is  between  p^  and  p^. 


imjn  tuf  m\ 
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Let  ui»u2'  * *  *  ,Uk  in<*ePendent  normal  variables  which  are 

centered  at  their  expectations  and  have  variances  p”1^”1, . . .  ,p“*, 

respectively.  Then  the  random  vector  ^m(p-p)  converges  in 

distribution  to  (p1<U1-U),  p2 (U2~U) , . . . ,pk (U^-U) )  where 

U  =  l  p.U. (cf.  Robertson  (1978)).  Hence,  appealing  to 
i=l  1  1 

Theorem  4.4  of  Billingsley  (1968),  we  have  that 

(✓m(p-p)  ,p,p,ot,  e)  ^  (p1(U1-U),...,pk(U]c-U),p,p,p,p) 

provided  Hq  is  true.  Thus,  under  Hq,  Sg^converges  in  distribution 
to 

)c  )c  ■— 

(3.2)  l  q.(U.-U)2  -  l  q.[E  (Ue.-U|C).]2. 

i=l  1  1  i=l  1  q  k  1 

«*,  «*» 

Now,  noting  that  E  (Ue.  -U|C)  =  Ue.  +  E  (-U|C),  squaring  the 

T  l»  •»  M 

binomials  in  (3.2),  combining  terms  and  using  Theorem  7.8  of 
Barlow  et  al.  (1972 \  (3.2)  can  be  rewritten  as 

\ml  1iIV"|c,i-"i>2 

where  =  -IT;  i  =  l,2,...,k.  Corollary  2.6  of  Robertson 
and  Wegman  (1978)  gives  the  desired  conclusion. 

If  q^  *  q2  *  •••  =  qk  *  k”1  then  the  P(£,k)  can  be  deter¬ 
mined  recursively  from  Corollary  B  on  p.  145  of  Barlow  et  al. 
(1972).  Their  Appendix  A5  gives  the  P(£,k)  for  k  £  12  in  this 
case.  However,  if  the  qA  are  not  all  equal  the  P(£,k)  are  much 
more  difficult  to  compute.  Equation  (3.23)  of  Barlow  et  al. 

(1972)  is  a  recursive  relation  from  which  one  can  obtain  the 
P(l,k)  provided  P(j,j)  is  known  for  j  <  k.  Barlow  et  al.  (1972) 
contains  closed  form  expressions  for  P(j,j)  with  j  <  4  and  the 
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tables  in  Abrahamson  (1964)  can  be  used  to  compute  P(5,5). 
Robertson  and  Wright  (1980)  have  obtained  bounds  for  certain 
chi-bar-squared  distributions.  Their  results  show  that 


(3.3)  lim^  P[S01  >  t]  <  (PtX^!  >  t]  +  P[x£_2  1  t])/2 

and  of  course,  one  could  obtain  a  conservative  test  using  the 
upper  bound  in  (3.3). 

Next,  we  consider  the  (one  sample)  likelihood  ratio  test 
of  versus  H2.  The  test  statistic  is 

k 

S12  =  -2  In  X  *  -2m  l  p^ln  Pi~ln  pil . 

Let  Pp (E)  denote  the  probability  of  the  event  E  computed  under 
the  assumption  that  p  is  the  population  vector  of  probabilities. 
Theorem  3.2.  For  any  p  satisfying  (ie.  p  >>  q)  and  for  all  t 

lim  P  IS.,  >  t)  <  lim  P  (S.,  >  t] 
p  12  —  —  q  12  — 

and 

Pq'S12  i  «  -  ltml  Pq«A)PI^.1  I  t). 

Proof .  Writing  a  second  order  Taylor's  expansion  for 

A 

In  about  the  point  p^  we  see  that  S12  can  be  written 

^  A 

(3.4)  l  p. 

i=l  1  1  11 

A 

where  is  between  p^  and  p^.  Now  we  want  to  obtain  the 
limiting  distribution  of  (3.4)  under  and  to  show  that  this 
limit  is  stochastically  largest  for  p  *  q.  Let  p  >>  q  and  let 
0  ■  Hq  <  <  . . .  <  hA  *  k  with  p^  +...•♦■  p^  »  q^  +...+  q^  for 

i  *  and  Pj^  ^  p  ^  ^  ^ . . .  ^  for  x  ^  ^2 '  *  *  * 
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By  the  strong  law  of  large  numbers,  for  almost  all  w  (in  the 
underlying  probability  space)  there  is  an  mQ (w)  and  an  e  >  0  for 

A  /V 

which  (q  +,  +...+  <3i)/(Pn  +1  +...+  p . )  <  1  -  e  for  each 

j  nj  ‘L  1 

j  =  0, . . . ,  A-l  and  i  >  Hj  with  i  +  n j+i • • • • > hA  and 

A  A 

(q_  +  •••  +  q_  )/(p_  +...+  p  )  >  1  -  e  for  each  0  <  j  <  t  <  k 

V  n-e  nj+l  n t  ~  * 

provided  m  >  m0(uj).  So  in  using  the  LSA  to  compute  E£  (q/p-ek|C) 
for  such  an  oo  and  m,  we  see  that  the  level  sets  are  of  the  form 
{ n j+1  ,  •  •  • ,  )  with  0  £  j  <  l  <_  k.  Consider  the  closed,  convex  cone 

D  —  {veC:  v.  ~  • • •  =  v  ,  v  , «  —  • • •  +  v  , • • ■ # v  _  s  • • •  =  v  } . 

l  r\1  r^+l  n2  n^_i+l  nA 

If  Ew(g|D)  denotes  the  projection  of  g  onto  D  with  respect  to  the 
distance  associated  with  (*,*)w,  then  for  such  w  and  m 


(3.5) 


E^(q/p-e.|C)  =  EMq/p-ejD) 


since  EMq/p-et|c)  eD.  One  way  to  compute  E  (g | D)  is  to  first 

P  K  W 

obtain  g*  which  is  constant  on  (n^+1, . . . ,n  +1)  by  setting 

wfV^D= J  +i  wf for  1  =  vi . Vi and 

j  =  0,1,..., A-l,  and  then  to  apply  the  LSA  to  g*  with  weights 

A  /S 

wlf...,wk.  If  g  =  q/p-ek  and  f  =  p/p-e^,  then  g*  =  f*  and  hence 

A  A  A  A  A 

EMq/p-e,  ]D)  =  E^ ( (p-p)/p | D) .  Clearly,  (^(p-p)  ,p,y)  converges 
P  *  P  _  ^  ^ 

in  distribution  to  (p^U^-U),  P2  (U2-U)  r  •  •  •  »Pk (Uk~U)  ,p,p)  with 

and  U  defined  as  before.  Using  (2.1),  we  see  that  (3.4) 
converges  in  distribution  to 

)c  3c 

(3.6)  l  p.  (E_(Ue.-u|D)  .)2  =  l  p.  (E  (W|D).  -  W) 2 
i=l  1  P  K  i  i=l  1  p  1 

k 

where  and  W.  =  -U.  for  i  =  l,2,...,k  and  W  *  £  p.  W. .  Since 
11  i=l 

Ep (W |d)  is  constant  on  thj+1, . . . , n^+1 }  for  j  =  0,1,...,A-1,  (3.6) 


A 


«  —  - 

can  be  written  as  l  qi(Ep{w|D)i  -  W)  .  To  compute  Ep(w|D) 

we  first  obtain  Wt  .  Z^+1  Pf  £«'  i  -  V1 . nj+1 

and  j  =  0,1,..., A-l.  Now,  if  T.  =  /p./q.  W. ,  T  *  j>  q.T.  and 
n  ■  n .  1  iii  i=i  1  1 

T*  =  then  (W*,...,W*)  has  the  same 

D  D  ~  k  k 

distribution  as  (TJ,...,T£).  Since  W  =  p.W*  =  7  q.Wt 

i  ^  I- —  1  l  l  i*“l  ^  ^ 

and  E  (w|d)  =  E  (W* | D)  =  E  (W*|D),  (3.6)  is  equal  in  distribution 

rk  P  -  2  q 

to  Z  q. (E  (T|D) .  -  T)  .  However, 
i=l  1  q  1 

<v  k  Jc  as, 

I  q±  (T.-T) 2  =  I  q^T.-E  (T|D).)2  +  l  q .  (E  (T|d).  -  T)2. 
i=l  1  1  i=l  1  1  q  1  i=l  1  q  1 

The  first  term  on  the  right  hand  side  of  the  previous  equation 

is  | |T-E  (T|d) | |  which  is  smallest  when  D  is  largest,  that  is 
4  4 

D  =  C,  which  occurs  if  p  =  q.  So  the  first  conclusion  of  the 
Theorem  is  established  and  it  follows  that  p  =  q  is  the 
asymptotically  least  favorable  distribution  in  in  the  sense 
that  the  probability  of  a  type  I  error  for  the  asymptotic  test  is 
largest  if  p  =  q.  The  second  conclusion  is  a  consequence  of  a 
result  due  to  Bartholomew  (cf.  Theorem  3.1  of  Barlow  et  al.  (1972)) 
and  the  proof  is  completed. 

As  we  have  noted  earlier,  the  computation  of  the  P(£,k)  may 
be  tedious  and  so  we  apply  the  bounds  for  chi-bar-squared 
distributions  given  in  Robertson  and  Wright  (1980)  to  obtain 


(3.7)  suppEHi  liPV.-  Pptsi2 


VS12  -  tJ  -  <fli»2'k+1  >>> *1-1  1  <=)• 


Of  course ,  the  upper  bound  in  (3.7)  can  be  used  to  determine  a 
conservative  asymptotic  test. 

If  q  is  known  one  might  want  to  test  Hn:  p  =  q  versus 
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H£:  q  >>  p.  However,  if  we  define 

p'  =  •  •  •  'P]^  and  q'  =  (qk»q]c_1, . . .  ,qx) 

then  p'  >>  q'  is  equivalent  to  q  >>  p.  So  the  tests  developed 
in  this  section  can  be  used  to  test  HQ:  p  -  q  versus  Hjs  q  >>  p 
and  versus  H2  with  q  known. 

A  A 

4.  TWO  SAMPLE  TESTS.  In  this  section  we  suppose  that  p  and  q 
are  the  relative  frequencies  of  successes  corresponding  to 
independent  random  samples  of  size  m  and  n  from  the  p  and  q 
populations  respectively.  We  first  consider  the  likelihood 
ratio  test  of  HQ  versus  -  Hq  where  H^:  p  >>  q.  The  test 
statistic,  -2  In  X,  can  be  expressed  as 

Jc  Ak  )c  /V 

(4.1)  TQ1  -  2m  l  p^lln  p.-ln  p?]  +  2n  £  qAln  q.-ln  q?] 

01  i=l  1  1  1  i*l  1  1  x 

0  o  A  A  _  _ 

where  pY  *  q.  =  (mp.+  nq, )/N;  i  *  l,2,...,k  and  p  and  q  are 

X  1  X  X 

given  by  Theorem  2.3. 

Theorem  4.1.  If  p  =  q,  then  for  each  real  t 

?IT01  i  11  *  ltml  Fp(i'k)PIXk-£  i  • 

Furthermore , 

s“Pp  =  q  PlT01  i  11  -  <P'Xk-l  1  «  +  PI*k-2  i  t,,/2' 

Proof .  Writing  a  second  order  Taylor's  expansion  for 

n  -  o 

In  pA  and  In  p£  about  pA  and  for  In  qj^  and  In  q ^  about  q^  we 

see  that  can  be  written  as  the  sum  of 
^  k 

(4 . 2)  J  P±  8“2  (^(pJ-Pi) ) 2  -  Pi  v”2(v1n(pi-pi))2 

and 

1c  1c 

(4.3)  J  »22(/5(q®-qi))2  -  ^  qt  a~2  (/rilq^q^)2 , 

A 
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0  ~  _  * 

where  8^(v^)  is  between  and  p^  (p^  and  p^)  and  p^(o^)  is 
0  ~  _  A 

between  qi  and  q^  (qA  and  qi).  Let  V  =  (vi »v2 , . . . ,Vk)  with  the 

independent  normal  variables  which  have  zero  means  and 

variances  .  • . , qk^,  respectively,  and  suppose  that  V  is 

independent  of  the  U  defined  in  the  previous  section.  If  we 
~  k 

set  V  *  £  q.V.,  then  as  m  and  n  simultaneously  approach  <» 

i=l  1  1 

(vin(p-p),  Sn( q-q))  5  (p^  (Uj-U)  , . .  .  ,pk  (Uk-U)  ,  qx  (Vj^-V)  , . . .  ,qk  (Vk-V) )  . 

A  A 

Furthermore,  since  pi  and  pi  (q^  and  q^)  are  strongly  consistent 
for  p^  (q^)  provided  p  >>  q,  it  follows  that,  with  probability  one, 

0  =  (0^,..*,8k)  P  ,  ^  =  { f  •  •  • ,  Vj^)  p,  p  *—  ( ,  *  •  * ,  Pk )  ~^  q  and 
o  =  (olf . . . ,0k)  ^  q. 

Let  p  =  q  and  m,n->®  so  that  m/N  -*■  ae[0,l].  Since  (4.2) 

A  AAA 

and  (4.3)  are  continuous  functions  of  (^(p-p)  ,  /n(q-q)  ,p,q,0,v,p,o)  , 
we  may  apply  the  weak  convergence  results  mentioned  earlier  to 
show  that  (4.2)  converges  in  distribution  to  the  product  of  (1-a) 
and 

(4.4)  l  p±{  I/a(Vi-V)-/I^a(Oi-U)]2-[Ep(/a(V-Vek)-/T:a(U-Uek)  | C)  ±]  2> 

and  (4.3)  converges  in  distribution  to  the  product  of  a  and  (4.4). 
Hence,  converges  to  (4.4),  which  can  be  written  as 

£k  p.  { I  (//aVi-/T:aU^)  +  (/I^aU-Zav) ) 2-  [E  (/aV-ZT^aUl  C) £+  (/I^aU-ZaV)  1 2). 
i=l  1  P 

Squaring  the  binomials  in  the  above  expression  and  applying 

Theorem  7.8  of  Barlow  et  al.  (1972),  this  expression  can  be  written  as 

(4.5) ^  Pi(W2  -  (Ep(W|C).)2}  =  Pi(Wi-Ep(W|C)i)2, 
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where  »i  =  U±  ~  NfO,??1)  and  W1#W2,...,Wk  are 

independent.  Since  the  limit  (expression  (4.5))  does  not  depend 
on  a,  Tqi  converges  in  distribution  to  (4.5)  for  any  sequence  of 
m  and  n's  which  both  approach  infinity  (cf.  Theorem  2.3  of 
Billingsley  (1968)).  As  we  have  seen  earlier  (4.5)  has  the 
chi-bar-squared  distribution  stated  in  the  first  conclusion  of 
the  theorem.  The  second  conclusion  follows  from  the  results 
given  in  Robertson  and  Wright  (1980). 

In  this  two  sample  situation  the  vector  p  is  not  specified 
by  Hq.  One  could  use  p^  ■  (p^».*.#Pk)  as  an  estimate  of  the 
unknown  p  and  compute  the  P(£,k)  based  on  this  estimate.  The 
use  of  the  resulting  chi-bar-squared  distribution  would  provide 
an  approximate  large  sample  test.  Or,  if  one  wanted  an  asymptotic 
test  with  size  a,  the  test  could  be  based  on  the  second  conclusion 
of  Theorem  4.1,  that  is  the  critical  value,  C,  could  be  chosen 
to  satisfy  PfXj^  >  C]  +  Plx£_2  -  C1  “  2ot- 

Next,  consider  a  likelihood  ratio  test  of  versus  H2  =  ~H^. 
The  test  statistic,  T12  =  -2  In  X,  can  be  written  as 

k  A  A  k  A 

(4.6)  T.-,  =  -2m  l  p.  Iln  p.-ln  p.]  -2n  £  q.  [In  q.-ln  q .  ] . 

i=l  1  1  1  i=i  i  x  i 

Theorem  4.2.  If  P_  _(E)  denotes  the  probability  of  the  event  E 

-  p,  q 

computed  under  the  assumption  that  p  and  q  are  the  values  of  the 
parameters,  then  for  each  real  t 


Proof.  Writing  a  second  order  expansion  for  In  p,  (In  q.) 

A  A 

about  the  point  (q^),  (4.6)  becomes 

(4.7)  l  p.  tT2  (v^n(pi-p. )  )2  +  l  q.  4»T2  (✓n  (q.-q.) )  2 

i=l  1  1  i=l  1  1  11 

A  /V 

where  x^($^)  is  between  p^  and  p^  (q^  and  q^.  Expressing 

A  A 

j>i~Pi  and  qi-qi  in  terms  of  projections,  we  see  that  (4.7)  becomes 

K  a  a 

(4.8)  (n/N)  l  p?  tT2  (/mn7N  E-(St£|C).)2 

i=l  p  *  * 

+  (nj/N)  l  q*  <r2(/mn7K  E-(3lE|C)  .)2. 

A  i  Vi  —  * 


Let  p  >>  q,  let  nQ  *  0  and  suppose  that  ^  <  • *  *  <  nA  are 

those  integers  ie{l,2,...,k)  for  p^^  +...+  p.^  =  q^^  +...+  q^.  By 
the  strong  law  of  large  numbers,  for  almost  all  o>  (in  the 
underlying  probability  space)  there  is  an  e  >  0,  an  m^(u)  and 

/V  A  A  A 

an  nQ(u>)  for  which  (q^  +1  +...+  q^/fp^  +1  +...+  p^  <  1  -  e  for 
each  j  =  0 , . . . , A-l  and  i  >  n j  with  i  +  nj+i' • • • »hA  and 

A  A  /S  A 

(q„  4.1  +...+  q„  )/(p„  +1  +...+  Pn  )  >  1  -  e  for  each  0  <  j  <  l  <  A 
nj+1  T)z  r)t 

provided  m  mQ(u)  and  n  >  n0(u>).  An  argument  like  that  given  in 
the  one  sample  problem  to  establish  (3.5)  shows  that 

A  A  A  A  /V  A  A  /\ 

eM2IR|C)  =  e-(3^|d)  and  E-(StH|C)  =  em2=£|d). 
pp  Pp  **  q  qq 

If  one  considers  the  algorithm  for  computing  Ew(g|D)  discussed 
in  the  last  section,  then  it  is  clear  that 

E/'(2I£|d)  =  E~  (-^lSLlliElEi|D)  and  E- (3i£|d)  =  E~  ( "(PrP)  )p) . 


Hence,  if  m,n+»  with  m/N  -►  ae(0,l],  then  (4.8)  converges  to 
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k  qVa  (V-Ve.  )  -p/I^a  (U-Ue.  )  , 

(4.9)  <l-a)  l  p  IE  ( - - Md,  412  + 

i-1  1  p  P  x 

„k  q^a(V-Ve.)-p/I^a(U-U)  , 

1  ^  qi IEq * - S - "»!> 

where  U,V,U  and  V  are  defined  as  before.  Again  it  is  clear  that 
Ep (f/p | D)  =  Eg (f/q | D)  for  any  f  defined  on  (1,2,. .  ,,k}  and  since 
Ew(*|D)  is  constant  on  In j+1, . . . ,Hj+1}  for  j  =  0,...,A-1,  (4.9) 
can  be  written  as 


(4.10) 


k  q/a(V-Vev)-p/T=a(U-Ue.  )  0 

liml  qi‘V - *-q - — !«»ii  - 


Let  T.  =  /p./q.  U.  for  i  =  1,2,... ,k  and  T  *  £  q.T. .  For 

i*l  1  1 

^  *■  find  ^  =  0  /  •  •  •  /  A-l  f  let 

Ui  *  ^f®rij+l  Pl 

and 

Ti*  -  1  ’(V^yi  V 

Clearly  (V1# . . . ,Vfc,Uj, . . . ,0J)  =  (V1# . . . ,Vfc,TJ, . . . ,TJ)  and 


*  Pt/t=S<V5» 

-  *  ^iui> 

e  ^  qt<V',>  -  ^  i^J+1  <’iTi| 

-  E^+i  q,c-«<vrv)  -  /r^(Tt-i)]. 

k 

Hence,  if  we  define  W,  *  /a  V.  -  /1-a  T.  and  W  *  £  q.W. ,  then 

-1  1-1 
VT  ~  N(0,qT'L)  and  W^,...,!^  are  independent.  If  we  consider  the 

computational  algorithm  for  Ew(g|D)  discussed  earlier,  then  we 

see  that  (4.10)  has  the  same  distribution  as 


A 
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k  ~  0 

(4.11)  l  q,[E  <w|D) .  -  w]\ 

i=l  1  q  1 

Since  (4.11)  does  not  depend  on  a,  converges  in  distribution 

to  (4.11)  for  any  sequence  of  m  and  n's  which  simultaneously 

approach  ®.  As  before,  (4.11)  is  made  stochastically  largest 

by  setting  D  =  C  or  p  =  q.  So  the  first  conclusion  of  the 

Theorem  is  established.  In  this  case,  that  is  p  =  q  or  D  =  C, 

(4.11)  has  a  chi-bar- squared  distribution  with  tail  probabilities 
k  2 

£  P  (£,k)P[Xp  i  >  t]  for  all  real  t.  The  second  conclusion 

1=1  P  l~1  ~ 

of  the  Theorem  follows  from  Theorem  1  and  Remark  2  of  Robertson 
and  Wright  (1980). 

5.  SUMMARY.  We  outline  below  the  procedures  that  have  been 
developed  here  for  testing  HQ:  p  *  q  vs.  -  Hq  where  p  >>  q 
and  11^  vs.  ■  -H^. 

A 

I.  One  sample  tests:  known  standard.  (p  is  the  relative 
frequency  estimate  of  p  based  on  a  sample  of  size  m  and  q  is 
known) . 

A  A 

A.  M. l.e.  of  p  subject  to  p  »  q:  p  *  p  E^(q/p|c)  where 
Ep ( * | C )  can  be  computed  by  the  LSA  or 

8  8 

p.  *  p.  minllal.  max^^  q^Z  Pyi  -  1,2,. ..,k. 

B.  Test  of  Hq  vs.  -  Hq. 

k  /v 

(1)  Test  statistic:  SQ1  =  -2  In  X  =  2m  £_  Pi  [In  p.-ln 

i**l  * 

(2)  Null  distribution: 

14V~  pts01  >  t]  -  PqU.k>P[x£_t  >  t). 
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C.  Test  of  H.  vs.  H, 

*  I. 

IV  A  A 

(1)  Test  statistic:  S.~  =  -2  In  X  =  2m  £  p. [In  p.-ln  p .  J 

lz  i=l  1  1  1 

(2)  Null  distribution. 

supp  »  q  linV*.  P>S12  i  *  ltml  pq«'k)I'IXf.ii‘) 

A  A 

II.  Two  sample  tests (p  and  q  are  independent  relative  frequency 
estimates  of  p  and  q  based  on  samples  of  size  m  and  n  respectively) 


AAA 


A.  M.l.e.  of  p  and  q  subject  to  p  »  q:  p  =  (p[N)E^( (mp+nq)/p|C) 


AAA 


and  q=-(q/N)  E~ (- (mp+nq)/q |C)  with  the  projections  computed 
by  the  LSA  or 

g  ^  g 

P±  «  (Pi/N)  «ninllali  max.^  Pr 


r- 

B 


q.  =  (qi/H)  »axliall  J  (mp^nq^/I  qj. 


B.  Test  Of  Hq  VS.  Hj  -  Hp 
(1)  Test  statistic:  T 


01 


-2  In  X  = 
k 


2m  l  p. [In  p.-ln  p . )  +  2n  l  q. [In  q.-ln  q.] 
i=l  1  1  1  i=l  1  1  1 


with 


pj  -  qj  -  (mpj+nq^/N. 

(2)  Null  distribution 

SUPP  .  q  U%,,n™pIT0l  >  tj  =  (PIx^  I  tl  ♦  P(X*_j  I  t])/2 
C.  Test  of  vs. 

(1)  Test  statistic:  T12  =  -2  In  X  = 

k  a  ^  k  /v  _ 

2m  l  p^ln  p^-ln  p^  +  2n  l  qi[ln  qj-ln  qil 
i*l  i=l 

(2)  Null  distribution: 


sup 


»  q  liVn~  PlT12  i  11  *  Jw 


.k-1. ,-k+l 
l*-l,z 


H  1  t). 


21 


REFERENCES 

Abrahamson,  I.  G.  (1964).  Orthant  probabilities  for  the  quadri- 

variate  normal  distribution.  Ann.  Math.  Statist.  35  1685-1703. 

Barlow,  R.  E.,  Bartholomew,  D.  J.,  Bremner,  J.  M.  and  Brunk, 

H.  D.  (1972).  Statistical  Inference  Under  Order  Restrictions. 
Wiley,  New  York. 

Barlow,  R.  E.  and  Brunk,  H.  D.  (1972).  The  isotonic  regression 
problem  and  its  dual.  J.  Amer.  Statist.  Assoc.  67  140-147. 

Billingsley,  Patrick  (1968).  Convergence  of  Probability  Measures. 
Wiley,  New  York. 

Brunk,  H.  D.  (1965).  Conditional  expectation  given  a  o-lattice 
and  applications.  Ann.  Math.  Statist.  36  1339-1350. 

Chacko,  V.  J.  (1966).  Modified  chi-square  test  for  ordered 
alternatives.  Sankhya  (B)  2j)  185-190. 

Robertson,  Tim  (1978).  Testing  for  and  against  an  order 

restriction  on  multinomial  parameters.  J.  Amer.  Statist.  Assoc. 
73  197-202. 

Robertson,  Tim  and  Wegman,  Edward  J.  (1978).  Likelihood  ratio 
tests  for  order  restrictions  in  exponential  families.  Ann. 
Statist.  6  485-505. 

Robertson,  Tim  and  Wright,  F.  T.  (1980).  Bounds  on  mixtures  of 

distributions  arising  in  order  restricted  inference.  (Submitted] 


UNCLASSIFIED 


SECURITY  CLASSIFICATION  OF  THIS  AA4I  TOTMo  Omm  faMM) 


REPORT  DOCUMENTATION  PAGE 


READ  INSTRUCTIONS 
BEFORE  COMPLETING  FORM 


RCCIPItMrt  CATALOO  NUMSCA 


♦-  TITLE  (m*  AiSWII.J  . .  . ....  i 

LIKELIHOOD  JgtATIO  TESTS  FOR  AND  AGAINST 
A  jgTOCHASTIC  ^RDERING  BETWEEN 
MULTINOMIAL  POPULATIONS ;  Tests  for  a 

efnnhacf  nr/^orinn  ^  ^ 


imTT"TTT 


Dept,  of  Statistics /  Dept,  of  Mathematics 
University  of  Iowa  Univ.  of  MO-Rolla 
Iowa  City,  Iowa  52242  Rolla,  MO.  65401 


II.  CONTNOLLINO  OFFICR  NAME  AND  AOORESS  ' 

Office  of  Naval  Research 
Statistics  and  Probability  Program  Code  43 
Arlington,  Virginia  2227/ 


SECURITY  CLASS.  ( 


IS.  (MSTRIRUTlON  STATEMENT  (»t  Ml 


APPROVED  FOR  PUBLIC  RELEASE:  DISTRIBUTION  UNLIMITEO. 


17.  CHSTfflflUTtON  STATKttCMT  (mi  mm  UrtfH  mtmem*  m  WmA  fft,  if  mtfrnmt  ftmm  Mm^mtt) 


It.  KEY  WORM  fCwlhil  m  WfifN  . . .  If  NHl 


Stochastic  ordering,  multinomial  distributions, 
chi-bar-squared  tests,  likelihood  ratio,  asymptotic  results. 


SS.  ASST  PACT  fCwS—  M  itwm  It*  M  I 


PLEASE  SEE  REVERSE  SIDE. 


•  jam  n  1473  toineN  of  *  nov  ss  «s  «— qlete 

*/N 


rr  classi  FIC  AT 


aSIFIED 


ABSTRACT 


-  *  UNCLASSIFIED 

WOt  Ml  TV  CUAtimCATIOM  Q?  TWO  (Wf  Bmm  £ 


Likelihood  ratio  tests  concerning  the  parameters  of  two 
multinomial  populations  are  discussed.  A  stochastic  ordering 
restriction  is  considered  as  a  one  sided  alternative  to 
equality.  The  one  and  two  sample  tests  for  equality  versus 
stochastic  ordering  and  stochastic  ordering  versus  all 
alternatives  are  derived  and  their  large  sample  distributions 
are  obtained.  The  large  sample  distributions  are  mixtures  of 
chi-squared  distributions.  The  tests  developed  provide 
discrete  analogues  for  the  one  sided  Mann-Whitney-Wilcoxin 
and  Kolmogorov-Smirnov  tests. 


s  v  j:  *:. 

UNCLASSIFIED 

*te«i»tvv  Ck*MiriCAVt««  gp  v**  am*  fcwm 


